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1. Introduction 



Consider a sample Y\, . . . ,Y n from a p-dimensional multivariate distribution 
with covariance matrix E p . An important problem in multivariate analysis is 
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to test the sphericity of such a multivariate distribution, namely the hypothesis 
Hq : I] p = (T 2 I p where a 2 is unspecified. If the observations represent a mul- 
tivariate error with p components, the null hypothesis expresses the fact that 
the error is cross-sectionally uncorrelated (independent if in addition they are 
normal) and have a same variance (homoscedasticity). 

Much of the ex i sting theory about this test has been exp osed first in d etails 



Mauchly. J.W 



(197: 



Sugiu ra. N 



Anderson 



10) 



(1940) about Gaussian LRT and later in 



J 1972) and also in textbooks like 



Muirhead 



John 



John! (119711) 

19821 . Chapter 



19841 Chapter 10). Assume for a moment that the sample has 



8) and 

a normal distribution with covariance matrix T, p (as far as the covariance matrix 
S p is concerned, we will assume in all the paper that the mean is zero). Let be 



the sample covariance matrix S n 



J2i YiY* and denote its eigenvalues by 



{£i}i<i< P - Two well established procedures for t e sting the sphericity are the 



likelihood ratio tes t and a t e st de vised in 
Anderson! jl984l . §10.7.2), 



statistic is, see e.g. 



John! (|197ll ). The likelihood ratio 



L, 



which is a power of the ratio of the geometric mean of the sample eigenvalues 
to the arithmetic mean. It is here noticed that in this formula it is necessary to 
assume that p < n to avoid null eigenvalues in (the numerator of) L n . If we let 
n — > oo while keeping p fixed, classical asymptotic theory indicates that under 
the null hypothesis, — 21ogL n ==> \p a chi-square distribution with degree of 
freedom / = hp(p + 1) — 1. This asymptotic distribution is further refined by 
the following Box-Bartlett correction: 

P{-2p logi n < x) = P f (x) + lo 2 {Pf+i{x) - P f (x)} + 0(n- 3 ) , 



where Pk{x) = P{x\ < x ) an d 



P = l- 



2p 2 +p + 2 
6pn 



0J2 = 



+ 2)(p - l)(p - 2){2p 3 + 6p 2 + 3p + 2) 



2 n 2 p 2 



The test procedure based on this correction will be hereafter referred as the 
LRT test. 
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By observing that t he asymptoti c variance of — 21ogL„ is proportional to 



trjE^rS)- 1 



John! ([19711 ) proposed to use the statistic 



T 2 = ^tr{S n (trS n )- 1 -p- 1 I p } 2 . 

It is observed that T 2 is proportional to the square of the coefficient of variation 
of the sample eigenvalues, namely 



To 



with £ 



n — ^ 



Following the idea of the Box-Bartlett correction, iNagaol ([19731 ) established an 
expansion for the distribution function of the statistics T 2 , 

P(T 2 <x) = P f {x) + - {a p P f+6 (x) + b p P f+4 (x) + c p P f+2 (x) + d p P f (x)} 



+0(n- 2 ), 



(1.1) 



where 

a P = 7z(P 3 + 3P 2 - 12 - 200P- 1 ), b p - l{-2p 3 - bp 2 + 7p - 12 - 420JT 1 ), 
c p = J(p 3 + 2p 2 - p - 2 - 216P" 1 ), dp = ^(-2P 3 -3p 2 + P + 436P- 1 ). 

The first purpose of the paper is to give an analysis of the LRT and John's 
test for sphericity of large-dimensional observations. It has been well-known that 
classical multivariate procedures are in general challenged by large-dimensional 



find in 


Johnstone and Titterington 


(2009 


) for a review and in 


Ledoit and Woli 




(2002 


)• 
)• 


Srivastava 


2005 


). 


Srivastava and Fuiikoshi 


(2006 


U 


jchott 


(2007 


). 


Bai et al. 


(2009 


Chen et al. 


(2010) 


. Chen and Qin 


(2010 


U 


Bai et al. 


(2012 


for some rep- 



resentative developments on the topic. 

Let us first examine the behavior of the LRT and John's test for large- 
dimensional data. Empirical sizes of these tests from a Monte-Carlo study are 
reported in Table [T] where normal vectors with mean zero and identity covariance 
matrix are used. The sample size is set to n = 64 while dimension p increases 
from 4 to 60 (we have also run other experiments with larger sample sizes n but 
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Table 1 

Empirical sizes of LRT and John's test at 5% significance level based on 10000 independent 
replications using normal vectors J\f(0,I p ) for n = 64 and different values of p. 



(p,n) 


(4,64) 


(8,64) 


(16,64) 


(32,64) 


(48,64) 


(56,64) 


(60,64) 


LRT 


0.0483 


0.0480 


0.0489 


0.0520 


0.0814 


0.2922 


0.6673 


Nagao's test 


0.0468 


0.0492 


0.0474 


0.0487 


0.0494 


0.0507 


0.0535 



observations are very similar). The nominal level equals a — 0.05 and empirical 
sizes are computed from 10000 independent replications for each pair of (p,n). 

We observe that the Box-Bartlett correction of LRT remains efficient for 
dimensions p < n/2 approximatively and, for larger p the test quickly becomes 
inconsistent with empirical sizes much larger than the nominal level (%5). In 
contrast, John's test is remarkably robust against the dimension inflation. 

In this paper, we propose new corrections to both the LRT and John's test 
which are consistent within the large-dimensional context. These corrections 
are valid for general observations with finite fourth moments (not necessarily 
normal). In the case of John's test, our findings give a new con firmation of its 



Ledoit and Wold (|2002l ) but under the 



robustness, a result previously found in 
normal assumption. 

From the technical point of view, our approach d iffers from 



Ledoit and Woll 



Bai et al 



2012). The 



(2002J) and follows the one devised in lBai et al.1 (|2009l ) and 
central tool is a CLT for linear spectral sta tistics of sample co variance matri- 



ces established in 



Bai and Silversteinl ( 20041 ) and later refined in 



Pan and Zhou 



(|2008l ). The paper also contains an original contribution on this CLT reported 
in the Appendix: new formula for the limiting parameters in the CLT are estab- 
lished. Since such CLT's are increasingly important in large-dimensional statis- 
tics, we believe that these new formula will be of independent interests for 
applications other than those considered in the paper. 

The remaining of the paper is organized as follows. Large-dimensional cor- 
rections to the LRT and John's test are introduced in Section[2] Next, Section[3] 
reports a detailed Monte-Carlo study to analyze finite-sample sizes and powers of 
these two corrected large-dimensional tests under both normal and non-normal 
distributed data. Technical proofs and calculations are relegated to Section |4j 
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The last section contains some concluding remarks. 
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2. Large-dimensional corrections 

From now on, we assume that the observations Y\ , . . . , Y n have the represen- 

1 /2 

tation Yj = H p ' Xj where the pxn table {Xi, . . . , X n } = {x i3 ;}i<i< p ,i<j<„ are 
made with an array of i.i.d. standardized random variables (mean and vari- 
ance 1). This setting is motivated by the random matrix theory and it is generic 
enough for a precise analysis of the sphericity test. Furthermore, under the null 
hypothesis H n : E p = a 2 I p (a 2 is unspecified), we notice that both the LRT and 
John's test are invariant tests. In particular, the test statistics are independent 
from the scale parameter a 2 under the null. Therefore, we can assume w.l.o.g. 
a 2 = 1 when dealing with the null distributions of these test statistics. This will 
be assumed in all the sections. 

Throughout the paper we will use an indicator k set to 2 when {xij} are real 
and to 1 when they are complex. Define the kurtosis coefficient /? — E\xi } .\ 4 — 1 — n 
for both cases and note that for normal variables, j3 = 0. 

2.1. The corrected likelihood ratio test 

For the LRT, let C n = —2n~ 1 logi„ for n > 1. Our first main result is the 
following. 

Theorem 2.1. Assume {x^} are iid, satisfying Exij — 7 E\xij\ 2 = 1, E\xij | 4 < 
oo. Then under Hq and when £ = y n — > y e (0, 1), 

£ n + (p-n)-log(l--)-p N{-?—^\og{l-y) + )-(}y, -n\og{l -y)-ny} . 

The test based on this asymptotic normal distribution will be hereafter re- 
ferred as the corrected likelihood-ratio test (CLRT). One may observe that the 
limiting distribution of the test crucially depends on the limiting dimension-to- 
sample ratio y through the factor — log(l — y). In particular, the asymptotic 
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variance will blow up quickly when y approaches 1. In this case, it is expected 
that the power of the test will seriously break down. Monte-Carlo experiments 
in Section [3] will provide more details on this behavior. 

The proof of Theorem 12.11 is based on the following lemma. In all the fol- 
lowing, F y denotes the Marcenko-Pastur distribution of index y (> 0) which is 
introduced and discussed in the Appendix. 

Lemma 2.1. Let {£i}i<i< p be the eigenvalues of the sample covariance matrix 
S n = n ^i^i* ■ Then under Hq and the conditions of Theorem \2.1\ we have 



Proof, (of Theorem EH). Let A n = £Li lo S^ - pF** (log x) and B„ = 
ELi U ~ pF Vn {x). By LemmaEU 



Consequently, —A n + B„ is asymptotically normal with mean — ^—^ log(l — y) + 




with 




and 





\{iy and variance 



V 1 (l,l) + Vi(2,2)-2Vi(l,2) 



Klog(l - y) - ny. 
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Besides, 



r 



-£? =1 log4+plog(-£? =1 4) 

- ( A n + P F y " (log x) ) + p log( - (B n + p) ) 

P 

-A„ - P F»«Qagx) +plog(l + ^). 

P 



Since B n => N(0,y{fi + K)), B n = O p (l) and log(l + B n /p) = B n /p + O p (l/p 2 ). 



Therefore, 



Cn = -A n - P F y - (log x) + B n + O p { 



k 

P 



The conclusion follows with the following well-known integrals w.r.t. the Marcenko- 
Pastur distribution F y (y < 1), 



F y (logx) 



y 



log(l-y)-l, F y (x) = l. 



The proof of Theorem 12.11 is complete 



□ 



2.2. The corrected John's test 



John 



Earlier than the asymptotic expansion (|1.1[> given in iNagaol (|1973l ) 
( 19711 ) proved that when the observations are normal, the spherici ty test based 
on T 2 is a locally most powerful invariant test. He also established in I John! (|1972l ) 
that under these conditions, the limiting distribution of T 2 under Hq is xj with 
degree of freedom / = \p{p + 1) — 1, or equivalently, 

2 o 



nU — p 



P 



Xf-P, 



where for convenience in this section, we have let U = 2{np)~ 1 T 2 . Clearly, this 
limit has been established for n — ¥ oo and a fixed dimension p. However, if we 
now let p — > oo in the right-hand side of the above result, it is not hard to see 
that Xf~ P w iH tend to the normal distribution N(l, 4). It then seems "natural" 
to conjecture that when both p and n grow to infinity in some "proper" way, it 
may happen that 



»Z7-p=>iV(l,4) 



(2.1) 
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This is indeed the main result of 



Ledoit and Woli (|2002j ) where this asymptotic 



distribution was established assuming that data are normal-distributed and p 
and n grow to infinity in a proportional way (i.e. p/n — > y > 0). 

In this section, we provide a more general result using our own approach. In 
particular, the distribution of the observation is arbitrary provide a finite fourth 
moment exists. 

Theorem 2.2. Assume {xij} are iid, satisfying Exij = 0,E\xij\ 2 — l,E\xij\ < 
oo, and let U — 2(np)~ 1 T2- Then under Hq and when p — > oo, n oo, ^ = y n — > 
y e (0,oo), 

nU-p-K-[3^ N(-l, 2k) . 

A striking fact in this theorem is that contrary to the limiting distribution 
for the corrected LRT, the limiting distribution of John's test is independent of 
the dimension to sample ratio y = \\mp/n. In this sense, Theorem 12.21 gives a 
theoretic explanation to the widely observed robustness of John's test against 
the dimension inflation. Moreover, John's test is also valid for the p larger (or 
much larger) than n case in contrast to the LRT where this ratio should be kept 
smaller than 1 to avoid null eigenvalues. 

It is also worth noticing that for real normal data, we have n = 2 and ft = 
so that the theorem above reduces to nil — p => N (l,4). This is exactly the 
result discussed above from iLedoit and Wola (|2002l ). Besides, if the data has a 



non-normal distribution but having the same first four moments as the normal 
distribution, we have again nU — p =>■ N(l, 4). 

The test based on the asymptotic normal distribution given in Theorem 12.21 
will be hereafter referred as the corrected John's test. Note that in practice for 
the application of the test with real data, we may estimate the parameter /? 
using the fourth-order sample moment: 

p n 

Up ■ 



p n 



»=i 3=1 

The proof of Theorcm l2.2l is based on the following lemma. 
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Lemma 2.2. Let {£i}i<i< p be the eigenvalues of the sample covariance matrix 
S n = n~ x J2i YiY*. Then under Hq and the conditions of Theorem VZ.'A we have 



with 



and 



fJ-2 







Vi = 



2 K y 2 + 4(/s + B)(y + 2y 2 + y 3 ) 2(/s + (3){y + y 2 ) 
2{n + B){y + y 2 ) (K + /3)y 

The proof of this Lemma is postponed to Section |U 

Proof, (of Theorem I2.2[) . The result of Lemma \2. 21 can be rewritten as: 



Define the function f(x,y) = % - 1, then U = /(p^E^?, P _1 E? = i*i). 
We have 

^(i + £ + ^±^,i) = i, 

ox n n 

df , p k + B - 1 s , p K + 

/ 1+- + ,1 ="2 1+- + ) , 

ay n n n n 

n n n n 



By the delta method, 

+ K + /? ~ 1 ,1) N ) =^M0,limC), 
V n n / 




0a; v 1 n 1 n 1 ' 

f£(l + £ + «±£=i I) 

oil y n n 1 ' 
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2k 



Therefore, 



n(U-]L- K + P ~ 1 )^N{Q,2k) 



n n 



that is, 



nU-p-K-/3=> N(-l, 2k) 



The proof of Theorem 12.21 is complete 



□ 



3. Monte Carlo study 



Monte Carlo simulations are conducted to find empirical sizes and powers of 
the corrected likelihood ratio test (CLRT) and the corrected John's test (CJ). 
In particular, here we want to examine the following questions: how robust 
are the tests against non-normal distributed data and what is the range of the 
dimension to sa mple ratio p/n wh ere the tests are applicable? 



More recently, 



ity. Further in the paper 



Srivastaval (I2OO5L puts forward a statistic T% to test the spheric- 



Srivastava et al 



(|201ll ) , they exam th e performance of 



Srivastava et al 



T\ un der non-normality. As the condition 7 = 3 + 0{p~ e ) in 
(|201 If) reduces to 7 — > 3 when the dimension p tends to infinity, which is a re- 
striction of the fourth moment matching the normal distribution. Besides, their 
result is the same as : nU — p — 1 => iV(0, 4) in the normal case (a small per- 
turbation is allowed), which is equivalent to LW's result. So we do not compare 
th eir result wi t h our ones in the following. 



Chen et al 



(2010 ) propose another tes t statistic to test the sphericity fol- 



lowing the idea of iLedoit and WolJ (|2002h but meanwhile they consider some 
higher order terms in order to estimate tr(E) and ir(£ 2 ). 



Let 
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i=l » i# 



I 1 * 

5"3,n = pa ' Yi ' n "mE X i X j X j X k 

1 

*5,n = pi X i X 3 X k X l > 



and define 



then under the sphericity hypothesis, they prove that 



(^)-n=^JV(0,4) (3.1) 



without distributional assumption for the data, which is exactly the model we 
are considering here and we refer to this test as C test. 

For comparison, we show both the performance of the LW test using the 
asymptotic N(l, 4) distribution in (|2.1I) (Notice however this is the CJ test under 
normal distributions) and the C test using the asymptotic iV(0,4) distribution 
in (|3.1|) . The nominal test level is set to be a = 0.05, and for each pair of (p, n), 
we run 10000 independent replications. 

We consider two scenarios with respect to the random vectors Yi : 

(a) Yi is p-dimensional real random vector from the normal population iV(0, I p ) . 
In this case, k = 2 and (3 = 0. 

(b) Yi consists of iid real random variables with distribution F(4, 2) — 2 so that 
yij satisfies Eyij = 0,Eyf, =4.5. In this case, K = 2 and j3 = 1.5. 



Table [2] reports the sizes of the four statistics in these two scenarios for 
different values of (p, n). We see that when {yij} are normal, both LW (=CJ), 
CLRT and C have similar empirical sizes tending to the nominal level 0.05 as 
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Table 2 

Empirical sizes of LW, CJ, CLRT and C test at 5% significance level based on 10000 
independent applications with real N(0, 1) random variables and with real Gamma(4,2)-2 

random variables. 



(p, n) 




N(0, 1) 






Gamma(4,2)-2 




LW/CJ 


CLRT 


C 


LW 


CLRT 


CJ 


C 


(4,64) 


0.0424 


0.0472 


0.494 


0.1102 


0.1566 


0.1577 


0.1469 


(8,64) 


0.0546 


0.0577 


0.0594 


0.131 


0.1136 


0.1234 


0.1139 


(16,64) 


0.0538 


0.056 


0.0589 


0.1373 


0.0836 


0.1002 


0.0896 


(32,64) 


0.0533 


0.0553 


0.0586 


0.134 


0.0635 


0.0804 


0.0709 


(48,64) 


0.0527 


0.0507 


0.0559 


0.1343 


0.0608 


0.0781 


0.0665 


(56,64) 


0.0542 


0.0533 


0.0603 


0.1301 


0.0603 


0.0761 


0.0657 


(60,64) 


0.0558 


0.0589 


0.0576 


0.1306 


0.059 


0.0729 


0.0681 


(8,128) 


0.0547 


0.055 


0.0584 


0.134 


0.1165 


0.1198 


0.1125 


(16,128) 


0.0523 


0.0519 


0.054 


0.1343 


0.0862 


0.0954 


0.0891 


(32,128) 


0.0548 


0.0532 


0.0566 


0.1347 


0.0691 


0.0819 


0.0768 


(64,128) 


0.0555 


0.0556 


0.0575 


0.1254 


0.0572 


0.0705 


0.0655 


(96,128) 


0.0532 


0.0554 


0.0557 


0.1226 


0.0521 


0.0626 


0.0608 


(112,128) 


0.0522 


0.0518 


0.0553 


0.123 


0.0532 


0.0632 


0.0618 


(120,128) 


0.0545 


0.0554 


0.0557 


0.1221 


0.0535 


0.0625 


0.0623 


(16,256) 


0.0577 


0.0587 


0.0598 


0.1324 


0.0869 


0.0919 


0.0912 


(32,256) 


0.0579 


0.0552 


0.0583 


0.1272 


0.0668 


0.071 


0.0691 


(64,256) 


0.0512 


0.0553 


0.0523 


0.1256 


0.058 


0.0618 


0.0596 


(128,256) 


0.0523 


0.0517 


0.0533 


0.1216 


0.0565 


0.0555 


0.0557 


(192,256) 


0.0508 


0.0522 


0.0523 


0.12 


0.0549 


0.0524 


0.0508 


(224,256) 


0.0494 


0.052 


0.051 


0.1169 


0.053 


0.0537 


0.0532 


(240,256) 


0.0496 


0.052 


0.051 


0.1166 


0.0532 


0.0517 


0.0525 


(32,512) 


0.0543 


0.0555 


0.0558 


0.1234 


0.0663 


0.069 


0.0676 


(64,512) 


0.0534 


0.052 


0.0542 


0.1212 


0.0567 


0.0594 


0.0589 


(128,512) 


0.053 


0.0502 


0.0527 


0.1147 


0.0544 


0.0529 


0.0529 


(256,512) 


0.0522 


0.0522 


0.0523 


0.1152 


0.0558 


0.0558 


0.0544 


(384,512) 


0.0484 


0.0493 


0.05 


0.1159 


0.0528 


0.0524 


0.0503 


(448,512) 


0.0514 


0.0497 


0.0514 


0.1143 


0.0511 


0.0511 


0.0514 


(480,512) 


0.0529 


0.0501 


0.054 


0.1116 


0.0502 


0.0549 


0.0521 
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either p or n increases. But when {yij} are Gamma-distributed, the sizes of the 
statistic LW arc higher than 0.1 no matter how large the values of p and n are 
while the sizes of CLRT and CJ all converge to the nominal level 0.05 as either p 
or n gets larger . This empirically confi rms that normal assumptions are needed 



for the result of 



Ledoit and Wolfl ((2002) while our corrected statistics CLRT and 



CJ (also the C test) have no such distributional restriction. 

Besides, for small values of n, such as n = 64 or n = 128, the sizes of CLRT 
are a little bit closer to 0.05 than those of CJ, the main reason is that in the 
proof of Theorem 12. 2[ we take the limit of C (the asymptotic variance there) , 
in which we regard the 0(l/yn 2 ) term as o(l). So for any y G (0, oo), the larger 
the value of n, the better the performance is. Another way of saying is that CJ 
is more sensitive to small values of n. Consider the same procedure we do in the 
proof of Theorem l2.1l for CLRT, where we regard log(l + B n /p) as B n /p plus a 
small term 0(l/p 2 ), and this small term is eventually discarded for large values 
of p. 

As for empirical powers, we consider two alternatives : 

(1) S p is diagonal with half of its diagonal elements 0.5 and half 1. We denote 
its power by Power 1 ; 

(2) E p is diagonal with 1/4 of the elements equal 0.5 and 3/4 equal 1. We 
denote its power by Power 2 . 

Table [3] reports the powers of LW(=CJ), CLRT and C when {y^} are dis- 
tributed as iV(0, 1), and of CJ, CLRT and C when {y^} are distributed as 
Gamma(4,2)-2, for the situation when n equals 64 or 128, with varying values 
of p and under the above mentioned two alternatives. For n = 256 and p varying 
from 16 to 240, all the tests have powers around 1 under both alternatives so 
that these values are omitted. And in order to find the trend of these powers, 
we also present the results when n = 128 in Figure [1] and Figure [2j 

The behavior of Power 1 and Power 2 in each figure related to the three 
statistics are similar, except that Power 1 is much higher compared with Power 
2 for a given dimension design (p, n) and any given test for the reason that the 
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Table 3 

Empirical powers of LW, CJ, CLRT and C test at 5% significance level based on 10000 
independent applications with real N(0, 1) random variables and with real Gamma(4,2)-2 
random variables under two alternatives Power 1 and 2 (see the text for details). 



N(0, 1) 


(p,n) 




Power 1 






Power 2 




LW/CJ 


CLRT 


C 


LW/CJ 


CLRT 


C 


(4,64) 


0.7172 


0.7411 


0.711 


0.3814 


0.5255 


0.3805 


(8,64) 


0.8123 


0.8227 


0.8042 


0.4303 


0.5844 


0.4278 


(16,64) 


0.8645 


0.8547 


0.8523 


0.4597 


0.5908 


0.4592 


(32,64) 


0.8991 


0.811 


0.89 


0.475 


0.5084 


0.4715 


(48,64) 


0.9092 


0.6751 


0.8992 


0.4825 


0.3821 


0.4834 


(56,64) 


0.9111 


0.5477 


0.9021 


0.4908 


0.2919 


0.4842 


(60,64) 


0.9112 


0.4345 


0.9014 


0.4908 


0.2312 


0.4841 


(8,128) 


0.9972 


0.9971 


0.997 


0.9071 


0.9639 


0.9032 


(16,128) 


0.9997 


0.9998 


0.9998 


0.9427 


0.9863 


0.9392 


(32,128) 


1 


1 


1 


0.9556 


0.9894 


0.9525 


(64,128) 


1 


1 


1 


0.9629 


0.9773 


0.9609 


(96,128) 


1 


0.9989 


1 


0.9644 


0.9143 


0.9621 


(112,128) 


1 


0.9863 


0.9999 


0.9649 


0.8065 


0.9598 


(120,128) 


1 


0.9522 


1 


0.9672 


0.695 


0.9644 



Gamma(A, 2) — 2 



(p,n) 




Power 1 






Power 2 




CJ 


CLRT 


C 


CJ 


CLRT 


C 


(4,64) 


0.5996 


0.6335 


0.6164 


0.3478 


0.4708 


0.3649 


(8,64) 


0.714 


0.7379 


0.7226 


0.3996 


0.5189 


0.4085 


(16,64) 


0.7912 


0.7829 


0.7921 


0.4396 


0.5439 


0.4406 


(32,64) 


0.8559 


0.7712 


0.8541 


0.4815 


0.4913 


0.4663 


(48,64) 


0.8837 


0.6587 


0.8777 


0.4904 


0.3755 


0.4788 


(56,64) 


0.8838 


0.5371 


0.88 


0.4899 


0.2941 


0.4791 


(60,64) 


0.8856 


0.4339 


0.884 


0.4876 


0.2368 


0.4763 


(8,128) 


0.9789 


0.9824 


0.9796 


0.8239 


0.9021 


0.8301 


(16,128) 


0.9971 


0.9976 


0.9973 


0.8901 


0.9537 


0.8919 


(32,128) 


0.9996 


0.9998 


0.9997 


0.9286 


0.9745 


0.9275 


(64,128) 


1 


0.9997 


1 


0.9495 


0.9672 


0.9488 


(96,128) 


0.9999 


0.9977 


1 


0.9586 


0.9037 


0.9552 


(112,128) 


1 


0.983 


1 


0.9606 


0.8026 


0.9558 


(120,128) 


1 


0.9482 


1 


0.9617 


0.6824 


0.9567 
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first alternative differs more from the null than the second one. The powers of 
LW (in the normal case), CJ (in the Gamma case) and C are all monotonically 
increasing in p for a given fixed value of n. But for CLRT, when n is fixed, the 
powers first increase in p and then become decreasing when p is getting close 
to n. This can be explained by the fact that when p is close to n, some of the 
eigenvalues of S n are getting close to zero, causing the CLRT nearly degenerate 
and losing power. 

Besides, from these figures, we can find that in the normal case the trend 
of C's power is very much alike of those of LW while in the Gamma case it is 
similar with those of CJ under both these two alternatives. And in most of the 
cases, the power of C test is slightly lower than LW (in the normal case) and 
CJ (in the Gamma case). 



N(0,1) n=128 N(0,1) 11=128 




20 40 60 80 100 120 20 40 60 80 100 120 

P P 



Figure 1: Empirical powers of LW/CJ, CLRT and C test at 5% significance level 
based on 10000 independent applications with real N(0,1) random variables for 
fixed n = 128 under two alternatives Power 1 and 2 (see the text for details). 



Lastly, we examine the performance of C J and C when p is larger than n and 
empirical sizes and powers are presented in Table |4j We choose the variables 
to b e distributed as Gamm a(4,2)-2 since CJ reduces to LW in the normal case, 
and iLedoit and Woli (|2002n has already reported the performance of LW when 
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Gamma(4.2)-2 n=128 



Gamma(4,2)-2 n=128 




CJ 

CLRT 

C 



20 40 



80 100 120 




Figure 2: Empirical powers of C J, CLRT and C test at 5% significance level based 
on 10000 independent applications with real Gamma(4,2)-2 random variables 
for fixed n = 128 under two alternatives Power 1 and 2 (see the text for details). 



p is larger than n. From the table, we see that when p is larger than n, the size 
of CJ is still correct and it converges to the nominal level 0.05 as the dimension 
p increases and the same phenomenon exists for C test. 

When we evaluate the power, the same two alternatives Power 1 and Power 
2 as above are considered. The sample size is fixed to n = 64 and the ratio p/n 
varies from 1 to 20. From the table, we see that Power 1 are much higher than 
Power 2 for the same reason that the first alternative is easier to be distinguished 
from Hq. Besides, the powers under both alternatives all increase monotonically 
(although quite slowly in the case of Power 2) as the dimension p increases 
(however the power has a small drop in the last case of (p, n) — (1280, 64) and 
we are suspecting that this is due to some numerical inaccuracy of eigenvalue 
computation for large p). Besides, the powers of CJ are always slightly higher 
than those of C in this "large p small n" setting. 

An interesting and somewhat surprising phenomenon here is that although 
asymptotic distribution for the CLRT and CJ are derived under the "Marcenko- 
Pasture scheme" (i.e p/n — > c > 0), the simulation results confirm that even 
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for "large p small n" situation such as (p, n) = (1280, 64), the tests still behave 
well, which turns out to be free from the assumption constraint "p/n c" . 

Summarizing all these findings from this Monte-Carlo study, the overall figure 
is the following: when the ratio p/n is much lower than 1 (say smaller than 1/2), 
it is preferable to employ CLRT (than CJ, LW or C); while this ratio is higher, 
CJ (or LW for normal data) becomes more powerful (slightly more powerful 
than C). 



Table 4 

Empirical sizes and powers (Power 1 and 2) of CJ test and C test at 5% significance level 
based on 10000 independent applications with real Gamma(4,2)-2 random variables when 

V 1— n - 



(p, n) 




CJ 






C 




Size 


Power 1 


Power 2 


Size 


Power 1 


Power 2 


(64,64) 


0.0732 


0.8883 


0.4869 


0.069 


0.8838 


0.4736 


(320,64) 


0.0631 


0.9208 


0.501 


0.0587 


0.9129 


0.4891 


(640,64) 


0.0587 


0.9296 


0.5141 


0.0552 


0.9186 


0.493 


(960,64) 


0.0579 


0.9315 


0.5143 


0.0544 


0.9202 


0.5006 


(1280,64) 


0.0599 


0.9284 


0.5088 


0.0544 


0.9178 


0.4916 



4. Additional proofs 

4 ■ 1 ■ Proof of Lemma \2.1\ 

Let for x > 0, f(x) — logx and g(x) = x. Define A n and B n by the decom- 
positions 

$>g 4 = pi f(x)d(F n (x) - (a;)) + P F»- (/) = A n + pF»« (/) , 
p f 

= P J g{x)d{F n (x)-F«»(x))+pF»"b) = B n +pF»»( 3 ) . 



i=l 



Applying Theorem 12.11 given in the Appendix to the pair (f,g), we have 

An \ ^ N (( EX f \ t C ™(Xf,Xf) Cov(A7,A 9 ) 
B n \\ EX g ) \CoY(X g ,X f ) Cov{X g ,X g ) 
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IN 



It remains to evaluate the limiting parameters and this results from the following 
calculations: 



h(f,r) 
h(g,r) 
W) 
h{g) 
Ji{f,9,r) 

M9,9,r) 
Mf,9) 
Mf,f) 

M9,g) 



i 



2 /„2\ 



= -log{l- h'/r 
= 0, 

= 

2 ' 
= 0, 

a. 

= --log(l -h*/r) 
r 

= h\ 
= h\ 
= h 2 . 



(4.1 
(4.2 
(4.3 
(4.4 
(4.5 

(4.6 

(4.7 
(4.8 
(4.9 
(4.10 



We now detail these calculations to complete the proof. They are all based on 



the formula given in Proposition IA.1I in the Appendix and repeated use of the 
residue theorem. 



Proof of (j4~Tj) : We have 



h(f,r) 



1 

2m j i£i=x 
1 

2m' j i£i =1 



/(ii + ^i 2 : 

log(|l + ^| 2 



i i 

£2 _ r -2 £ 



c 2 



1 



-L / (1 io g ((i + ^o 2 ) + \ io g ((i + ^r 1 ) 2 ) 



2tu /j £ | =1 2 
1 



2?ri 



£1=1 



log(l + K) 



l€l=i 



log(l + /iO|^ 



l€l=i 



i 



i og (i + fcr 1 )-^ 

lfl=l ? 



For the first integral, note that as r > 1, the poles are ±- and we have by the 
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residue theorem, 



2ni j\(\ = x 
log(l + h$ ■ £ 



1 h 2 



, log(l + 



For the second integral, 

iog(i+^)irfe=iog(i+/ie)| e=0 = o. 
=1 t 



i 



ISI=i 



The third one is 



2ni 



ISI=l 



2tt« 
1 



1*1=1 



log(l + frz) 



- 1 -1 



z 



dz 



log(l + /iz)r 2 _ log(l + /iz)r 2 



= , 



z=0 



27ri J\ z \ = i z(z + r)(z — r) {z + r)(z — r) 

where the first equality results from the change of variable z = j , and the third 
equality holds because r > 1, so the only pole is z = 0. 
The fourth one equals 

^-(f log(l + ^r 1 )7^ = -^r / log(l + hz)^-dz 
2m 4 27ri .71*1=1 z 



'ISI=i 
log(l + /iz)l =0 



Collecting the four integrals leads to the desired formula for Ji(/,r). 



Proof of g2J): We have 
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h(9,r) = ^~<f s(|1 + ^| 2 )[t^-7-7]^ 



lfl=i 



2m 

^-.i |i + ^l 2 

2™ /|f|=i 

1 
2iTi 



L £ 2 



i«i=i 



i 

(1 + ^)72-^-2^- 



+ ^ (1 + ^)- 

41=1 ; 



l + fe£ 
41=1 « 



These four integrals are calculated as follows: 



1 
2~7ri 



I4l=i 
I /' 1 + h£ 



(1 + h0 £ «=™ 

£2_ r ^ £ ^ r 1 



(1 + 



4= 



£_ r -l 



2?ri 



141=1 



d£, = 1 + fa; 



1 



277 /./. 1 ^-r" 2 "* 



'I4l=i 



4=0 

1 / J + h 

141=1 C 



£ - r- 1 e=-r-! £ + r _1 
1 + 1 



= 1 

,-1 

t + h 



27TI Jj S | =1 £ 27TI 7| ? | =1 £ 

Collecting the four terms leads to Ii{g, r) = 0. 



= 1 



4=o 



Proof of fOJ): 
£(/) : 



1 



1 



2;; ^ hi log(|l + ^| 2 )^ 



1 



4l=i 



log(l + hQ 

a 3 



alt 



41=1 



io g (i + h^ 1 ) 
a 3 



alt 



We have 



1 

2~7ri 
1 



log(l + 



41=1 e- 
logci + ^r 1 ) 

141=1 



1 a 2 
2 



,3 



1 

~2~7ri 



= --ft 2 : 
5=o 2 

log(l + /iz) -1 

i ' T2" 

| 2 |=i 73- z 
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Proof of gTj) 



1 / (1 + &£)(! + , t _ 1 / £ + h£ 2 + h + h 2 £ 



Proof of dUSJ): 



We have, 



1 /' log(l + ^)^ i + J_/ y + ff^. 



The first term 

27r«/| £l | =1 (fi -r£ 2 ) 2 

because for fixed |£ 2 | = 1, I^Cal = l r l > lj so r £ 2 is not a pole. 
The second term is 

1 jf logil + h^ 1 ) 1 f log(l + hz) -1 



2™j\ 6l \=i (a-r-6) 2 S 27rf jrf,| =1 (i - r&)a z 2 

i i r io g (i + hz) , i a , . , , 

= ' TTyI r ~t — TT2 = 7TT2 ' «~~ + 



where the first equality results from the change of variable z = and the 
third equality holds because for fixed |£ 2 | = 1, 1 7^ I = < 1, s ° 7^ is a pole 
of second order. 
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Therefore, 

Mf>9,r) 



2nir 2 



(l + h&)(l + h&) 



£a|=l 6(6 + 7) 



i> / 6 + Hi + h + /i 2 6 



2mr 2 J^\=i 6 2 (6 + 7 ; 



rf6 



2irir 2 



1 + h 2 



I6l=i 6(6 



-d6 



161=1 6 



-^6 



2,, , M^ 2 



I6i=i £ 2 (6 + 7 



Finally we find Ji(/, g, r) = \ since 
ft /" 1 + ft 2 



, t-^6 = , 

27m- 2 j^ 2|=1 £ 2 (6 + £) 



27rir 2 /j C2 | =1 £ 2 



0/ gU): 



Proof 
JiUJ.r) = 



h 



/(|1 



r /(|i + ft6l 2 ) , r 
(6-6) 2 ^ 6 

■d6 



^7 / 

27TI 7j fe | =1 

-fe6| 2 ) 7-7-7- ,, 

r6( r 6 + «) 

log(l + h^ 1 ) 

— T— «?2 



27rir 2 ^ 2 | =1 6(^+6) ^ir 2 J^ =1 £ 2 (; 



7+6) 



;2 ■ 



We have 



and 



log(i + ft6) 

2™- 2 J[ £2 | =1 6(1+6) ta 



log(i + h& 



7 + 6 



1, , ^ 

-log 1-- 

r r 



log(l + ft^" 1 ) 



log(l + /16) 



?2=0 



6 = 



2mr 2 J^ =1 £ 2 (£+£ 2 ) 
I /" log(l + hz) 



<*6 



27rir 2 



log(l + hz) -1 



i(2* + i) 



2-Kir 



\z\ = l 



dz = 
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where the first equality results from the change of variable z = ^ , and the third 
equality holds because |? | > 1, so £ is not a pole. 
Finally, we find J t (J, /, r) = -± log(l - £) . 



Proof of (H71) : 



1 



2m 



Ji(9,9,r) = —<b + 



We have 



since 



2?ri /| ?1 |=i (6 - r£ 2 ) 2 



72 — 



27T* (6 - r£ 2 ) 2 27TZ jfj ?1 | =1 6(6 - r&) 



f 



l + /i 2 



6M fifci-rGO 9 * 1 



r 2 & ' 



1 + h 2 



1 



2vri J| Sl |=i (6 - r£ 2 



r^i = , — 



2?ri 7| ?1 |=i (6 - r£ 2 ) : 



-rfa = 



da 



2^7| 6 | =i a(a-^2) 2 ~ «i-r£ 2 ) 



6=o 



r 2 Cl ' 



where the equality above holds because for fixed |^| = 1, 1^2 1 = l r l > 1 ; so 
is not a pole. Therefore, 

h f & + h£$ + h + h 2 & 



2irir 2 
h 

2ixir 2 



1 + h 



161=1 



72— rf 6 

S2 



-^2 
/? 



|^|=i 6 J|6|=i £2 



r^2 
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Proof of glj) gJJ) BlOl) : We have 

log(|l + ^i| 2 ) 



2ni 



161=1 



27TJ 



161=1 



1 

2~7ri 



i«ii=i £1 



2-1 



since 



I6l=i 4i o?i v 



1 
2~7ri 

27ri.^ f ,i_ n ^ 



6=0 



log^l + ^r 1 ) 1 
— "SI = — 



161=1 

— <b log(l + = 



log(l + hz) 1 



2t™ 



i=i 



Similarly, 
1 



3(|1 + ^2| 2 ),, 1 

161 = 1 «2 27U.7| ?2 | =1 



27ri ./i £ 2 



el 



d£ 2 = h. 



Therefore, 

ML 9) 



1 

27ri 



/(ll + ^ll 2 ),, 1 



61=1 



M9> 9) = 7T- f 

2tti 7|6|=i 



1 




/(II 


+ ^i| 2 ) 


■1 




5(|1 


-f^ii 2 ) 


1 


3 



2tH Jj fe | =1 CI 



61 = 1 



/(|1 + ^2| 2 ) 



dfa = /i 2 , 



^l-7T- 



L i g(|i + fe6l 2 ) 

2vri £ 2 



d£ 2 = ^ 2 



The proof of Lemma |2~T1 is complete. 



4.2. Proof 0/ Lemma [Q 

Let /(x) = x 2 and #(2;) = x. Define C n and i?„ by the decompositions 
P r 

= P f( x ) d ( F nix) ~ F»"(x))+pF»»(f) = C» +pF*»(/) , 
i=l ^ 

i=l 
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Applying Theorem 1 2 . 1 1 given in the Appendix to the pair (f,g), we have 

I Cn \^ N (( Ex A ( coy ( x /' x /) cov (*/>^) 

\B n ) V V EX 9 ) V Cov(X g ,Xf) Cov(X g ,X g ) 

It remains to evaluate the limiting parameters and this results from the following 
calculations: 



h(f,r) 


h 2 




(4.11) 


h(g,r) 


= o, 




(4.12) 


W) 


= h\ 




(4.13) 


h(g) 


= o, 




(4.14) 




2h 2 


+ 2h A 


(4.15) 


Mf,f,r) 


2h A 


+ (2h + 2h 3 ) 2 r 


(4.16) 


J\{g,g,r) 


h 2 




(4.17) 


Mf,g) 


= 2h 2 


+ 2h 4 , 


(4.18) 


Mf,f) 


= (2h 


+ 2h 3 ) 2 , 


(4.19) 




= h 2 . 




(4.20) 



The results (1412)) . (l4~T4|) , (|4T7| and (|4~20l) are exactly the same as those found 
in (|4.2|) . (|4.4|) . (|4.7|) and (|4.10j) in the proof of Lemma l2.ll The remaining results 
are found by similar calculations using Proposition IA.1I in the Appendix and 
their details are omitted (available however in a separated on-line supplementary 
file). 

5. Concluding remarks 

Using recent central limit theorems for eigenvalues of large sample covariance 
matrices, we are able to find new asymptotic distributions for two major pro- 
cedures to test the sphericity of a large-dimensional distribution. Although the 
theory is developed under the scheme p — ¥ oo, n — > oo and p/n — > y > 0, our 
Monte-Carlo study has proved that: on the one hand, both the corrected LRT 
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and the corrected John's test are already very efficient for middle dimension 
such as (p, n) — (96, 128) both in size and power, see Tableland Table |31 and 
on the other hand, the corrected John's test also behaves very well in most of 
"large p, small n" situation, see Table |H 

Three characteristic features emerge from our findings: 

(a) These asymptotic distributions are universal in the sense that they depend 
on the distribution of the observations only through its first four moments; 

(b) The new test procedures improve quickly when either the dimension p or 
the sample size n gets large. In particular, for a given sample size n, higher 
dimensions p lead to better performance of these corrected test statistics. 

(c) The corrected John's test is particularly robust against the dimension 
inflation. Our Monte-Carlo study shows that for a small sample size n = 
64, the test is effective for < p/n < 20. 

In a sense, these new procedures have benefited from the "blessings of di- 
mensionality" . 
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Appendix A: Formula for limiting parameters in the CLT for 
eigenvalues of a sample covariance matrix with 
general fourth moments 



Given a sample covariance matrix S n of dimension p with eigenvalues Ai , 



linear spectral statistics of the form F n (g) = p 1 X)f=i ff(^j) f° r suitable func- 
tions g are of central importance in multivariate analysi s. Such CLT's have 



been successively develop ed si nce the pioneering work o f 



Bai and Silversteinl (|2004i) and 
on the subject. 
The CLT in 



Lvtova and Pasturl ([20091 ) for a recent account 



Jonssonl (|1982l ). see 



Bai and Silverstein 



Bai and Silversteinl (|2004l ) (see also an improved version in 



2010)) has been widely used in applications as this CLT 



also provides, for the first time, explicit formula for the mean and covariance 
parameters of the normal limiting distribution. In the special case with an ar- 
ray {xij} of independent variables, this CLT assumes the following moment 
conditions: 

(a) For each n, = ; , i ^ p, j ^ n are independent. 

(b) Exij = 0, E\xij | 2 = 1, maxj^n E\xij | 4 < oo. 

(c) If {xijYs are real, then Exfj — 3; otherwise (complex variables), Ex\i = 
and E\xij\ 4 — 2. 

In Condition (c) , the fourth moments of the entries are set to the values 3 or 
2 matching the normal case. This is indeed a quite demanding and restrictive 
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2') 



condition since in the real case for example, it is incredibly hard to find a non- 
normal distribution with mean 0, variance 1 and fourth moment equaling 3. As 
a consequence, most of if not all applications published in the literature using 
this CLT assumes a normal distr i bution for the observations. Rec ently , effort 
have been made in 



Pan and Zhoul (|2008[ ) . iLvtova and Pastuij (j2009t ) and 



Zheng 



( 20121 ) to overcome these moment restrictions. We present below such a CLT 
with general forth moments that will be used for the sphericity test. 

In all the following, we use an indicator k set to 2 when {xij} are real and to 
1 when they are complex. Define /3 — E\xij | 4 — 1 — k for both cases and h = y/y. 

For the presentation of the results, let be the sample covariance matrix S n — 
— y^Lj XjXf where Xi = (xki)i<k<p is the i-th observed vector. It is then 
well-known that when p — > oo, n — > oo and p/n — > y > , the distribution of 
its eigenvalues converges to a nonrandom distribution, namely the Marcenko- 
Pastur distribution F v with support [a, b] = [(1 ± ^/y) 2 ] (an additional mass at 
the origin when y > 1). Moreover, the Stieltjes transform m of a companion 
distribution defined by F_ v = (1 — y)So + yF c satisfies an inverse equation for 



z e 



l 

m 



y 



1 + m 

The following CLT is a particular instance of Theorem 1.4 in 
4200J). 



(A.1) 



Pan and Zhou 



Theorem A.l. 



Pan and Zhot 



2008 )] Assume that for each n, the variables 



= xfj ,i ^ p, j ^ n are independent and identically distributed satisfying 
Exij — 0, E\xij\ 2 — 1, -E|iEjj| 4 = /3 + 1 + k < 00 and in case of they are 
complex, Ex 2 a = 0. Assume moreover, 



P 



00, n — > 00, p/iHi/>0 . 



Let fx, - • • fk be functions analytic on an open region containing the support of 
F y . The random vector {X n {f\), ■ ■ ■ X n (fk)} where 



Xn(f)=p{F n (f)-Fy-(f)} 
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:>>0 



converges weakly to a normal vector (Xf 1 , ■ ■ ■ Xf k ) with mean function and co- 
variance function: 



E[X f ] = (« - l)h(f) + PhU) , 
Cov(X f ,X g ) = KJi(f,g)+PMf,g) , 



(A.2) 
(A.3) 



where 
h(f) 

W) 
Ji(f,g) 
Mf,g) 



1 

1 

1 

An 2 



y{m/(l + m)} (z)f(z) 



. dz 

1 — y {m/(l + m)} 2 

y{m/(l + m)} 3 (z)f(z) 
1 — y {m/(l + to)} 2 

f{zi)g(z 2 ) . , , , 

- — ? — r — --^m (zi)m {z 2 )dzidz 2 , 

(m(zi) - m(z 2 )) 2 



(z 2 ) > dz 2 , 



where the integrals are along contours (non overlapping in J\) enclosing the 
support of F v . 

However, concrete applications of this CLT are not easy since the limiting 
parameters are given through those integrals on contours that are only vaguely 
defined. The purpose of this appendix is to go a step further by providing al- 
ternative formula for these limiting parameters. These new formula, presented 
in the following Proposition convert all these integral along the unit circle; they 
are much easier to use for concrete applications, see for instance the proofs of 
Lemma 12.11 and 12.21 in the paper. Furthermore, these formula will be of inde- 
pendent interest for applications other than those developed in this paper. 



Proposition A.l. The limiting parameters in Theorem \A.1\ can be expressed 
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as following: 



h(f) = lira h(f,r) , (A.4) 

W) = ^/ e _/(i 1 + ^i 2 )^' ( A - 5 ) 

Jx (/,<?) = limJx(/, 5 ,r) , (A.6) 

i / /(li + ^il 2 ),, / s(|i + ^ 2 | 2 ) 



Mf,g) = --7-2 f Vl ,2 f - 72 .(A.7) 



with 



Mfi9^ r ) = —7-2 f f u F~Y2 o?iafe ■ 

Proof. We start with the simplest formula / 2 (/) to explain the main argument 
and indeed, the other formula are obtained similarly. The idea is to introduce 
the change of the variable z = 1 + hr£ + hr^ 1 ^ + h 2 with r > 1 but close t o 1 an d 
|£| = 1 (recall h = ^/y). Note that this idea has been employed in lZhena (|2012f ). 
It can be readily checked that when £ runs counterclockwiscly the unit circle, z 
will run a contour C that encloses closely the support interval [a, b] = [(1 ± h) 2 } 
(recall h — y/y). Moreover, by the Eq. (|A.1I) . we have on C 

m = ; — , and dz = h(r — r _1 £ _2 )d£ . 

— 1 + hrf y ' 

Applying this variable change to the formula of I%(f) given in Theorem lA.il we 

have 

If 1 r£ 2 - r^ 1 

This proves the formula (|A.5j) . For ()A.4[) . we have similarly 

= 5? h £ =1 m Yrl 2 e - 1) 1 - r V a de 

r4.l 2?ra - r z ) 

= lim/i(/,r). 
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Formula (|A.7j) for J 2 (/, g) is calculated in a same fashion by observing that we 
have 

I -k \£A« -IU<}«- 

Finally for ()A.6|) . we use two non-overlapping contours defined by Zj = 1 + 
/Wj£j + hrj 1 ^ + h 2 , j — 1,2 where r 2 > r*i > 1. By observing that 



for. 



we find 



1 I I /(*l)ff(«2) 



^2 > n > 
r 2 4- 1 



i 471-2 -/la 1=1 ^2 |=i {m(«i) -m(>2)} 
hr\ hr 2 



2 



(1 + ^a) 2 {l + hr 2 ^ 2 

i jr r /(^i)g(^) 



/•, • /•, > i, 471-2 ^ii=i^i=i {nSi - r 2 6} 2 



• ri > 
r 2 |l 



hm-^ / / /(u + ^i 2 ) g (u+^n, 



The proof is complete. □ 



